Search CORE

25 research outputs found

End-to-End Simultaneous Speech Translation

Author: Ma Xutai
Publication venue: 'The Busan Gyeongnam Mathematical Society'
Publication date: 30/01/2023
Field of study

Speech translation is the task of translating speech in one language to text or speech in another language, while simultaneous translation aims at lower translation latency by starting the translation before the speaker finishes a sentence. The combination of the two, simultaneous speech translation, can be applied in low latency scenarios such as live video caption translation and real-time interpretation. This thesis will focus on an end-to-end or direct approach for simultaneous speech translation. We first define the task of simultaneous speech translation, including the challenges of the task and its evaluation metrics. We then progressly introduce our contributions to tackle the challenges. First, we proposed a novel simultaneous translation policy, mono- tonic multihead attention, for transformer models on text-to-text translation. Second, we investigate the issues and potential solutions when adapting text-to-text simultaneous policies to end-to-end speech-to-text translation models. Third, we introduced the augmented memory transformer encoder for simultaneous speech-to-text translation models for better computation efficiency. Fourth, we explore a direct simultaneous speech translation with variational monotonic multihead attention policy, based on recent speech-to-unit models. At the end, we provide some directions for potential future research

JScholarship

An Analysis of Source Context Dependency in Neural Machine Translation

Author: Koehn Philipp
Li Ke
Ma Xutai
Publication venue: European Association for Machine Translation
Publication date: 01/01/2018
Field of study

The encoder-decoder with attention model has become the state of the art for machine translation. However, more investigations are still needed to understand the internal mechanism of this end-to-end model. In this paper, we focus on how neural machine translation (NMT) models consider source information while decoding. We propose a numerical measurement of source context dependency in the NMT models and analyze the behaviors of the NMT decoder with this measurement under several circumstances. Experimental results show that this measurement is an appropriate estimate for source context dependency and consistent over different domains.This work was partially supported by the IARPA MATERIAL program

Repositorio Institucional de la Universidad de Alicante

Findings of the IWSLT 2021 Evaluation campaign

Author: Anastasopoulos Antonios
Bojar Ondřej
Bremerman Jacob
Cattoni Roldano
Elbayad Maha
Federico Marcello
Ma Xutai
Nakamura Satoshi
Negri Matteo
Niehues Jan
Pino Juan
Salesky Elizabeth
Stüker Sebastian
Sudoh Katsuhito
Turchi Marco
Waibel Alexander
Wang Changhan
Wiesner Matthew
Publication venue
Publication date: 01/01/2021
Field of study

KITopen